-
Re: African languages word lists
Octo-text is a good starting point and we should keep updating it.There is also corpus crawler data Google collected a few years ago (https://github.com/google/corpuscrawler) with files with words an…2 -
Re: Escape accented characters in Font Info?
@"Igor Freiberger" The Mac Roman encoding needs to use Mac Roman values, not Unicode UTF-16 values. So in your case ú is \00FA in UTF-16 and \9C in Mac Roman. I’m not sure anyone needs to u…3 -
Re: Escape accented characters in Font Info?
According to the spec (8c and 9e), OT Feature should take UTF-8 and convert it for you, so you shouldn't have to escape Ä: featureNames { # Windows, Unicode BMP, English (same as default) name 3 1 0x…3 -
Re: Check language support tools
@"Johannes Neumeier" The argument for Ǥ, Ʒ, Ǯ, ǥ, ʒ, ǯ in names doesn’t seem to hold when looking at corpora actually, they pretty much do not occur as the norm is to use translated forms. …3 -
Re: Check language support tools
The Unicode CLDR auxiliary exemplar character data is defined as "Additional characters for common foreign words, technical usage" and has the following description: SIL’s SLDR, Rosetta’s H…1